Modified MSP-1 nucleic acid sequences and methods for increasing MRNA levels and protein expression in cell systems

ABSTRACT

The invention provides modified recombinant nucleic acid sequences (preferably DNA) and methods for increasing the mRNA levels and protein expression of malarial surface protein MSP-1 which is known to be difficult to express in cell culture systems, mammalian cell culture systems, or in transgenic animals. The preferred protein candidates for expression using the recombinant techniques of the invention are MSP-1 proteins expressed from DNA coding sequences comprising reduced overall AT content or AT rich regions and/or mRNA instability motifs and/or rare codons relative to the native MSP-1 gene.

This application claims the benefit of a previously filed Provisional Application No. 60/085,649, filed May 15, 1998, and Provisional Application No. 60/062,592, filed Oct. 20, 1997, the contents of which are incorporated in their entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates to heterologous gene expression. More particularly, the invention relates to the expression of malaria genes in higher eukaryote cell systems.

2. Summary of the Related Art

Recombinant production of certain heterologous gene products is often difficult in ill vitro cell culture systems or ill vivo recombinant production systems. For example, many researchers have found it difficult to express proteins derived from bacteria, parasites and virus in cell culture systems different from the cell from which the protein was originally derived, and particularly in mammalian cell culture systems. One example of a therapeutically important protein which has been difficult to produce by mammalian cells is the malaria merozoite surface protein (MSP-1).

Malaria is a serious heath problem in tropical countries. Resistance to existing drugs is fast developing and a vaccine is urgently needed. Of the number of antigens that get expressed during the life cycle of P. falciparum, MSP-1 is the most extensively studied and promises to be the most successful candidate for vaccination. Individuals exposed to P. falciparum develop antibodies against MSP-1, and studies have shown that there is a correlation between a naturally acquired immune response to NISP-1 and reduced malaria morbidity. In a number of studies, immunization with purified native MSP-1 or recombinant fragments of the protein has induced at least partial protection from the parasite (Diggs et al, (1993) Parasitol Today 9:300-302). Thus MSP-1 is an important target for the development of a vaccine against P. falciparum.

MSP-1 is a 190-220 kDA glycoprotein. The C-terminal region has been the focus of recombinant production for use as a vaccine. However, a major problem in developing MSP-1 as a vaccine is the difficulty in obtaining recombinant proteins in bacterial or yeast expression systems that are equivalent in immunological potency to the affinity purified native protein (Chang et al., (1992) J. Immunol. 148:548-555.) and in large enough quantities to make vaccine production feasible.

Improved procedures for enhancing expression of sufficient quantities of MSP-1 would be advantageous.

BRIEF SUMMARY OF THE INVENTION

The present invention provides improved recombinant DNA compositions and procedures for increasing the mRNA levels and protein expression of the malarial surface antigen MSP-1 in cell culture systems, mammalian cell culture systems, or in transgenic mammals. The preferred protein candidate for expression in an expression system in accordance with the invention is a C-terminal derivative of MSP-1 having a DNA coding sequence with reduced AT content, and eliminated mRNA instability motifs and rare codons relative to the recombinant expression systems. Thus, in a first aspect, the invention provides a DNA sequence derived from the sequence shown in SEQ ID NO 2. This derivative sequence is shown in SEQ ID NO 1.

In a second aspect, the invention provides a process for preparing a modified nucleic acid of the invention comprising the steps of lowering the overall AT content of the natural gene encoding MSP-1, eliminating all mRNA instability motifs and replacing all rare codons with a preferred codon of the mammary gland tissue, all by replacing specific codons in the natural gene with codons recognizable to, and preferably preferred by mammary gland tissue and which code for the same amino acids as the replaced codon. This aspect of the invention further includes modified nucleic acids prepared according to the process of the invention.

In a third aspect, the invention also provides vectors comprising modified MSP-1 nucleic acids of the invention and a goat beta casein promoter and signal sequence, and host cells transformed with nucleic acids of the invention.

In a fourth aspect, the invention provides transgenic non-human mammals whose germlines comprise a nucleic acid of the invention.

In a fifth aspect, the invention provides a DNA vaccine comprising a modified MSP-1 gene according to the invention.

DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts the cDNA sequence of MSP-1₄₂ modified in accordance with the invention [SEQ ID NO.: 1] in which 306 nucleotide positions have been replaced to lower AT content and eliminate mRNA instability motifs while maintaining the same protein amino acid sequence of MSP-1₄₂ [SEQ ID NO. 9]. The large letters indicate nucleotide substitutions.

FIG. 2 depicts the nucleotide sequence coding sequence of the “wild type” or native MSP-1₄₂ [SEQ ID NO.: 2] and its deduced amino sequence [SEQ ID NO.: 10].

FIG. 3(A-B) is a codon usage table for wild type MSP-1₄₂ (designated “MSP wt” in the table) and the new modified MSP-1₄₂ gene (designated “edited MSP” in the table) and several milk protein genes (casein genes derived from goats and mouse). The numbers in each column indicate the actual number of times a specific codon appears in each of the listed genes. The new MSP-1₄₂ synthetic gene was derived from the mammary specific codon usage by first choosing GC rich codons for a given amino acid combined with selecting the amino acids used most frequently in the milk proteins.

FIG. 4a-c depict MSP-1₄₂ constructs GTC 479, GTC 564, and GTC 627, respectively as are described in the examples.

FIG. 5 panel A is a Northern analysis wherein construct GTC627 comprises the new MSP-1₄₂ gene modified in accordance with the invention, GTC479 is the construct comprising the native MSP-1₄₂ gene, and construct GTC469 is a negative control DNA

FIG. 5 panel B is a Western analysis wherein the eluted fractions after affinity purifications numbers are collected fractions. The results show that fractions from GTC679 the modified MSP-1₄₂ synthetic gene construct reacted with polyclonal antibodies to MSP-1 and the negative control GTC479 did not.

FIG. 6 depicts the nucleic acid sequences of TO1 [SEQ ID NO 3], TO2 [SEQ ID NO 4], MSP-8 [SEQ ID ON 5], MSP-2 [SEQ ID NO 6] and MSP1 [SEQ ID NO 7] described in the Examples.

FIG. 7 is a schematic representation of plasmid BC574.

FIG. 8 is a schematic representation of BC620.

FIG. 9 is a schematic representation of BC670.

FIG. 10 is a representation of a Western blot of MSP transgenic milk.

FIG. 11 is a schematic representation of the nucleotide sequence of MSP42-2 [SEQ ID NO.: 8] and its deduced amino acid sequence [SEQ ID NO.: 11].

FIG. 12 is a schematic representation of the BC-718.

FIG. 13 is a representation of a Western blot of BC-718 expression in transgenic milk.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The patent and scientific literature referred to herein establishes the knowledge that is available to those with skill in the art. The issued US patents, allowed applications, published foreign applications, and references cited herein are hereby incorporated by reference. Any conflicts between these references and the present disclosure shall be resolved in favor of the present disclosure.

The present invention provides improved recombinant DNA compositions and procedures for increasing the mRNA levels and protein expression of the malarial surface antigen MSP-1 in cell culture systems, mammalian cell culture systems, or in transgenic mammals. The preferred protein candidate for expression in an expression system in accordance with the invention is a C-terminal derivative of MSP-1 having a DNA coding sequence with reduced AT content, and eliminated mRNA instability motifs and rare codons relative to the recombinant expression systems. Thus, in a first aspect, the invention provides a DNA sequence derived from the sequence shown in SEQ ID NO 2. This derivative sequence is shown in SEQ ID NO 1.

In preferred embodiments, the nucleic acid of the invention is capable of expressing MSP-1 in mammalian cell culture systems, or in transgenic mammals at a level which is at least 25%, and preferably 50% and even more preferably at least 100% or more of that expressed by the natural gene in mammalian cell culture systems, or in transgenic mammals under identical conditions.

As used herein, the term “expression” is meant mRNA transcription resulting in protein expression. Expression may be measured by a number of techniques known in the art including using an antibody specific for the protein of interest. By “natural gene” or “native gene” is meant the gene sequence, or fragments thereof (including naturally occurring allelic variations), which encode the wild type form of MSP-1 and from which the modified nucleic acid is derived. A “preferred codon “means a codon which is used more prevalently by the cell or tissue type in which the modified MSP-1 gene is to be expressed, for example, in mammary tissue. Not all codon chances described herein are changes to a preferred codon, so long as the codon replacement is a codon which is at least recognized by the mouse mammary tissue. The term “reduced AT content” as used herein means having a lower overall percentage of nucleotides having A (adenine) or T (thymine) bases relative to the natural MSP-1 gene due to replacement of the A or T containing nucleotide positions or A and/or T containing codons with nucleotides or codons recognized by mouse mammary tissue and which do not change the amino acid sequence of the target protein.

In a second aspect, the invention provides a process for preparing a modified nucleic acid of the invention comprising the steps of lowering the overall AT content of the natural gene encoding MSP-1, eliminating all mRNA instability motifs and replacing all rare codons with a preferred codon of mammary gland tissue, all by replacing specific codons in the natural gene with codons recognizable to, and preferably preferred by mammary gland tissue and which code for the same amino acids as the replaced codon. Standard reference works describing the general principals of recombinant DNA technology include Watson, J. D. et al, Molecular Biology of the Gene, Volumes I and II the Benjamin/Cummings Publishing Company, Inc. publisher, Menlo Park, Calif. (1987) Darnell, J. E. et al., Molecular Cell Biology, Scientific American Books, Inc., Publisher, New York, N.Y. (1986); Old, R. W., et al., Principles of Gene Manipulation: An Introduction to Genetic Engineering, 2d edition, University of California Press, publisher, Berkeley Calif. (1981); Maniatis, T., et al., Molecular Cloning: A Laboratory, Manual, 2^(nd) ed. Cold Spring Harbor Laboratory, publisher, Cold Spring Harbor, N.Y. (1989) and Current Protocols in Molecular Biology, Ausubel et al., Wiley Press, New York, N.Y. (1992). This aspect of the invention further includes modified nucleic acids prepared according to the process of the invention.

Without being limited to any theory, previous research has indicated that a conserved AU sequence (AUUUA) from the 3′ untranslated region of GM-CSF mRNA mediates selective mRNA degradation (Shaw, G. and Kamen, R. Cell 46:659-667). The focus in the past has been on the presence of these instability motifs in the untranslated region of a gene. The instant invention is the first to recognize an advantage to eliminating the instability sequences in the coding region of the MSP-1 gene.

In a third aspect, the invention also provides vectors comprising modified MSP-1 nucleic acids of the invention and a goat beta casein promoter and signal sequence, and host cells transformed with nucleic acids of the invention.

In a fourth aspect, the invention provides transgenic non-human mammals whose germlines comprise a nucleic acid of the invention. General principals for producing transgenic animals are known in the art. See for example Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory, (1986); Simons et al, Bio/Technology 6:179-183, (1988); Wall et al., Biol. Reprod. 32:645-651, (1985); Buhler et al., Bio/Technology, 8:140-143 (1990); Ebert et al., Bio/Technology 9:835-838 (1991); Krimenfort et al., Bio/Technology 9:844-847 (1991); Wall et al., J. Cell. Biochem. 49:113-120 (1992). Techniques for introducing foreign DNA sequences into mammals and their germ cells were originally developed in the mouse. See e.g., Gordon et al., Proc. Natl. Acad. Sci. USA 77:7380-7384, (1980); Gordon and Ruddle, Science 214: 1244-1246 (1981); Palmiter and Brinster, Cell 41: 343-345, 1985; Brinster et al., Proc Natl. Acad Sci., USA 82:4438-4442 (1985) and Hogan et al. (ibid.). These techniques were subsequently adapted for use with larger animals including cows and goats. Up until very recently, the most widely used procedure for the generation of transgenic mice or livestock, several hundred linear molecules of the DNA of interest in the form of a transgenic expression construct are injected into one of the pro- nuclei of a fertilized egg. Injection of DNA into the cytoplasm of a zygote is also widely used. Most recently cloning of an entire transgenic cell line capable of injection into an unfertilized egg has been achieved (KHS Campbell et al., Nature 380 64-66, (1996)).

The mammary gland expression system has the advantages of high expression levels, low cost, correct processing and accessibility. Known proteins, such as bovine and human alpha-lactalbumin have been produced in lactating transgenic animals by several researchers. (Wright et al, Bio/Technology 9:830-834 (1991); Vilotte et al, Eur. J. Biochem.,186:43-48 (1989); Hochi et at., Mol Reprod. And Devel. 33:160-164 (1992); Soulier et al., FEBS Letters 297(1,2):13-18 (1992)) and the system has been shown to produce high levels of protein.

In a fifth aspect, the invention provides a DNA vaccine comprising a modified MSP-1 gene according to the invention. Such DNA vaccines may be delivered without encapsulation, or they may be delivered as part of a liposome, or as part of a viral genome. Generally, such vaccines are delivered in an amount sufficient to allow expression of the modified MSP-1 gene and to elicit an antibody response in an animal, including a human, which receives the DNA vaccine. Subsequent deliveries, at least one week after the first delivery, may be used to enhance the antibody response. Preferred delivery routes include introduction via mucosal membranes, as well as parenteral administration.

EXAMPLE Creation of Novel Modified MSP-1₄₂ Gene

A novel modified nucleic acid encoding the C-terminal fragment of MSP-1 is provided. The novel, modified nucleic acid of the invention encoding a 42 kD C-terminal part of MSP-1 (MSP-1₄₂ ) capable of expression in mammalian cells of the invention is shown in FIG. 1. The natural MSP-1₄₂ gene (FIG. 2) was not capable of being expressed in mammalian cell culture or in transgenic mice Analysis of the natural MSP-1₄₂ gene suggested several characteristics that distinguish it from mammalian genes. First, it has a very high overall AT content of 76%. Second, the mRNA instability motif, AUUUA, occurred 10 times in this 1100 bp DNA segment (FIG. 2). To address these differences a new MSP-1₄₂ gene was designed. Silent nucleotide substitution was introduced into the native MSP-1₄₂ gene at 306 positions to reduce the overall AT content to 49.7%. Each of the 10 AUUUA mRNA instability motifs in the natural gene were eliminated by changes in codon usage as well. To change the codon usage, a mammary tissue specific codon usage table, FIG. 3a, was created by using several mouse and goat mammary specific proteins. The table was used to guide the choice of codon usage for the modified M5P-1₄₂ gene as described above. For example as shown in the Table in FIG. 3a, in the natural gene, 65% ({fraction (25/38)}) of the Leu was encoded by TTA, a rare codon in the mammary gland. In the modified MSP-1₄₂ gene, 100% of the Leu was encoded by CTG, a preferred codon for Leu in the mammary gland.

An expression vector was created using the modified MSP-1₄₂ gene by fusing the first 26 amino acids of goat beta-casein to the N-terminal of the modified MSP-1₄₂ gene and a SalI-Xho I fragment which carries the fusion gene was subcloned into the XhoI site of the expression vector pCDNA3. A His6 tag was fused to the 3′ end of the MSP-1₄₂ gene to allow the gene product to be affinity purified. This resulted in plasmid GTC627 (FIG. 4c).

To compare the natural MSP-1₄₂ gene construct to the modified MSP-1₄₂ nucleic acid of the invention, an expression vector was also created for the natural MSP-1₄₂ gene and the gene was added to mammalian cell culture and injected into mice to form transgenic mice as follows:

Construction of the Native MSP-1₄₂ Expression Vector

To secrete the truncated merozoite surface protein-1 (MSP-1) of Plasmodium falciparum, the wild type gene encoding the 42KD C-terminal part of MSP-1 (MSP-1₄₂) was fused to either the DNA sequence that encodes the first 15 or the first 26 amino acids of the oat beta-casein. This is achieved by first FCR amplify the MSP-1 plasmid (received from Dr. David Kaslow, NIH) with primers MSP1 and MSP2 (FIG. 6), then cloned the PCR product into the TA vector (Invitrogen). The BglII-XhoI fragments of the PCR product was ligated with oligos TO1 and TO2 (FIG. 6) into the expression vector pCDNA3. This yielded plasmid GTC564 (FIG. 4b), which encodes the 15 amino acid beta-casein signal peptide and the first 11 amino acids of the mature goat beta-casein followed by the native MSP-1₄₂ gene. Oligos MSF-8 and MSP-2 (FIG. 6) were used to amplify MSP-1 plasmid by FCR, the product was then cloned into TA vector. The XhoI fragment was exercised and cloned into the XhoI site of the expression vector pCDNA3 to yield plasmid GTC479 (FIG. 4a), which encoded 15 amino acid goat beta-casein signal peptide fused to the wild-type MSP-1₄₂ gene. A His6 tag was added to the 3′ end of MSP-1₄₂ gene in GTC 564 and CTC 479.

Native MSP-1₄₂ Gene is not Expressed in COS-7 Cells

Expression of the native MSP gene in cultured COS-7 cells was assayed by transient transfection assays. GTC479 and GTC564 plasmids DNA were introduced into COS-7 cells by lipofectamine (Gibco-BRL) according to manufacturer's protocols. Total cellular RNA was isolated from the COS cells two days post-transfection. The newly synthesized proteins were metabolically labeled for 10 hours by adding ³⁵S methionine added to the culture media two days-post transfection.

To determine the MSP mRNA expression In the COS cells, a Northern blot was probed with a ³²p labeled DNA fragment from GTC479. No MSP RNA was detected in GTC479 or GTC564 transfectants (data not shown). Prolong,ed exposure revealed residual levels of degraded MSP mRNA. The ³⁵S labeled culture supernatants and the lysates were immunoprecipitated with a polyclonal antibody raised against MSP. Immunoprecipitation experiments showed that no expression from either the lysates or the supernatants of the GTC479 or GTC564 transfected cells (data not shown). These results showed that the native MSP-1 gene was not expressed in COS cells.

Native MSP-1₄₂ Gene is not Expressed in the Mammary Gland of Transgenic Mice

The SalI-Xhot fragment of GTC479, which encoded the 15 amino acids of goat beta-casein signal peptide, the first 11 amino acids of goat beta-casein, and the native MSP-1₄₂ gene, was cloned into the XhoI site of the beta-casein expressed in vector BC350. This yielded plasmid BC574 (FIG. 7). A SalI-NotI fragment of BC574 was injected into the mouse embryo to generate transgenic mice. Fifteen lines of transgenic mice were established. Milk from the female founder mice was collected and subjected to Western analysis with polycolonal antibodies against MSP. None of the seven mice analyzed were found to express MSP-1₄₂ protein in their milk. To further determine if the mRNA of MSP-1₄₂ was expressed in the mammary gland, total RNA was extracted from day 11 lactating transgenic mice and analyzed by Northern blotting. No MSP-1₄₂ mRNA was detected by any of the BC 574 lines analyzed. Therefore, the MSP-1₄₂ transgene was not expressed in the mammary gland of transgenic mice. Taken together, these experiments suggest that native parasitic MSP-1₄₂ gene could not be expressed in mammalian cells, and the block is as the level of mRNA abundance.

Expression of MSP in the Mammalian Cells

Transient transfection experiments were performed to evaluate the expression of the modified MSP-1₄₂ gene of the invention in COS cells. GTC627 and CTC479 DNA were introduced into the COS-7 cells. Total RNA was isolated 48 hours post-transfection for Northern analysis. The immobilized RNA was probed with ³²P labeled SalI-XhoI fragment of GTC627. A dramatic difference was observed between GTC479 and GTC627. While no MSP-1₄₂ mRNA was detected in the GTC479 transfected cells as shown previously, abundant MSP-1₄₂ mRNA was expressed by GTC627 (FIG. 5, Panel A). GTC 469 was used as a negative control and comprises the insert of GTC564 cloned into cloning vector PU19, a commercially available cloning vector. A metabolic labeling experiment with ³⁵S methionine followed by immunoprecipitation with polyclonal antibody (provided by D. Kaslow NIAID, NIH) against MSP showed that MSP-1₄₂ protein was synthesized by the transfected COS cells (FIG. 5, Panel B). Furthermore, MSP-1₄₂ was detected in the transfected COS supernatant, indicating the MSP-1₄₂ protein was also secreted. Additionally, using Ni-NTA column, MSP-1₄₂ was affinity purified from the GTC627 transfected COS supernatant.

These results demonstrated that the modification of the parasitic MSP-1₄, gene lead to the expression of MSP mRNA in the COS cells. Consequently, the MSP-1₄₂ product was synthesized and secreted by mammalian cells.

Polyclonal antibodies used in this experiment may also be prepared by means well known in the art (Antibodies: A Laboratory Manual, Ed Harlow and David Lane, eds. Cold Spring Harbor Laboratory, publishers (1988)). Production of MSP serum antibodies is also described in Chang et at., Infection and Immunity (1996) 64:253-261 and Chang et al., (1992) Proc Natl. Acad. Sci. USA 86:6343-6347.

The results of this analysis indicate that the modified MSP-1₄₂ nucleic acid of the invention is expressed at a very high level compared to that of the natural protein which was not expressed at all. These results represent the first experimental evidence that reducing the AT % in a gene leads to expression of the MSP gene in heterologous systems and also the first evidence that removal of AUUUA mRNA instability motifs from the MSP coding region leads to the expression of MSP protein in COS cells. The results shown in FIG. 5, Panel A Northern (i.e. no RNA with native gene and reasonable levels with a modified DNA sequence in accordance with the invention), likely explains the increase in protein production.

The following examples describe the expression of MSP1-42 as a native non-fusion (and non-glycosylated) protein in the milk of transgenic mice.

Construction of MSP Transgene

To fuse MSP1-42 to the 15 amino acid β-casein signal peptide, a pair of oligos, MSP203 and MSP204 (MSP203: [SEQ ID NO.: 12]: ggccgctcgacgccaccatgaaggtcctcataattgcctgtctggtggctctggccattgcagccgtcactccctccgtcat, MSP204: [SEQ ID NO.: 13]: cgatgacggagggagtgacggctgcaatggccagagccaccagacaggcaattatgaggaccttcatggtggcgtcgagc), which encode the 15 amino acid casein signal and the first 5 amino acids of the MSP1-42 ending at the ClaI site, was ligated with a ClaI-XhoI fragment of BC620 (FIG. 8) which encodes the rest of the MSP1-42 gene, into the XhoI site of the expression vector pCDNA3. A XhoI fragment of this plasmid (GTC669) was then cloned into the XhoI site of milk specific expression vector BC350 to generate B670 (FIG. 9).

Expression of MSP1-42 in the Milk of Transgenic Mice

A SalI-NotI fragment was prepared from plasmid BC670 and microinjected into the mouse embryo to generate transgenic mice. Transgenic mice was identified by extracting mouse DNA from tail biopsy followed by PCR analysis using oligos GTC17 and MSP101 (sequences of oligos: GTC17 [SEQ ID NO.: 14], GATTGACAAGTAATACGCTGTTTCCTC, Oligo MSP101 [SEQ ID NO. 15], GGATTCAATAGATACGG). Milk from the female founder transgenic mice was collected at day 7 and day 9 of lactation, and subjected to western analysis to determine the expression level of MSP-1-42 using a polyclonal anti-MSP antibody and monoclonal anti MSP antibody 5.2 (Dr. David Kaslow, NIH). Results indicated that the level of MSP-1-42 expression in the milk of transgenic mice was at 1-2 mg/ml (FIG. 10).

Construction of MSP1-42 Glycosylation Sites Minus Mutants

Our analysis of the milk-produced MSP revealed that the transgenic MSP protein was N-glycosylated. To eliminate the N-glycosylation sites in the MSP1-42 gene, Asn (N) at positions 182 and 263 were substituted with Gln (Q). The substitutions were introduced by designing DNA oligos that anneal to the corresponding region of MSP1 and carry the AAC to CAG mutations. These oligos were then used as PCR primers to produce DNA fragments that encode the N to Q substitutions.

To introduce N263-Q mutation, a pair of oligos, MSPGYLYCO-3 [SEQ ID NO.: 16] (CAGGGAATGCTGCAGATCAGC) and MSP42-2 [SEQ ID NO.: 17] (AATTCTCGAGTTAGTGGTGGTGGTGGTGGTGATCGCAGAAAATACCATG, FIG. 11), were used to PCR amplify plasmid GTC627, which contains the synthetic MSP1-42 gene. The PCR product was cloned into pCR2.1 vector (Invitrogen). This generated plasmid GTC716.

To introduce N182-Q mutation, oligos MSPGLYCO-1 [SEQ ID NO.: 18] (CTCCTTGTTCAGGAACTTGTAGGG) and MSPGLCO-2 (GTCCTGCAGTACACATATGAG, FIG. 4) were used to amplify plasmid GTC627. The PCR product was cloned into pCR2.1. This generated plasmid GTC700.

The MSP double glycosylation mutant was constructed by the following three steps: first a Xho I-Bsm I 1fragment of BC670 and the Bsm I-Xho I fragment of GTC716 is ligated into the Xho I site of vector pCR2.1. This resulted a plasmid that contain the MSP-1-42 gene with N262-Q mutation. EcoN I-Nde I fragment of this plasmid was then replaced by the EcoN I-Nde I fragment from plasmid GTC716 to introduce the second mutation N181-Q. A Xho I fragment of this plasmid was finally cloned into BC350 to generate BC718 (FIG. 12).

Transgenic Expression Nonglycosylated MSP-1

BC718 has the following characteristics: it carries the MSP1-42 gene under the control of the β-casein promoter so it can be expressed in the mammary gland of the transgenic animal during lactation. Further, it encodes a 15 amino acid β-casein leader sequence fused directly to MSP1-42 so that the MSP1-42, without any additional amino acid at its N-terminal, can be secreted into the milk. Finally, because the N-Q substitutions, the MSP produced in the milk of the transgenic animal by this construct will not be N-glycosylated. Taken together, the transgenic MSP produced in the milk by BC718 is the same as the parasitic MSP.

A SalI/XhoI fragment was prepared from plasmid BC718 and microinjected into mouse embryos to generate transgenic mice. Transgenic animals were identified as described previously. Milk from female founders was collected and analyzed by Western blotting with antibody 5.2. The results, shown in FIG. 13 indicate expression of nonglycosylated MSP1 at a concentration of 0.5 to 1 mg/ml.

8 1 1065 DNA preferably, a bacterium, virus, or parasite 1 gccgtcactc cctccgtcat cgataacatc ctgtccaaga tcgagaacga gtacgaggtg 60 ctgtacctga agccgctggc aggggtctac cggagcctga agaagcagct ggagaacaac 120 gtgatgacct tcaacgtgaa cgtgaaggat atcctgaaca gccggttcaa caagcgggag 180 aacttcaaga acgtgctgga gagcgatctg atcccctaca aggatctgac cagcagcaac 240 tacgtggtca aggatcccta caagttcctg aacaaggaga agagagataa gttcctgagc 300 agttacaact acatcaagga tagcattgat accgatatca acttcgccaa cgatgtcctg 360 ggatactaca agatcctgtc cgagaagtac aagagcgatc tggattcaat caagaagtac 420 atcaacgata agcagggaga gaacgagaag tacctgccct tcctgaacaa catcgagacc 480 ctgtacaaga ccgtcaacga taagattgat ctgttcgtga tccacctgga ggccaaggtc 540 ctgaactaca catatgagaa gagcaacgtg gaggtcaaga tcaaggagct gaattacctg 600 aagaccatcc aggataagct ggccgatttc aagaagaaca acaacttcgt cgggatcgcc 660 gatctgagca ccgattacaa ccacaacaac ctgctgacca agttcctgag caccggtatg 720 gtcttcgaaa acctggccaa gaccgtcctg agcaacctgc tggatgggaa cctgcagggg 780 atgctgaaca tcagccagca ccagtgtgtg aagaagcagt gtccccagaa cagcgggtgt 840 ttcagacacc tggatgagag agaggagtgt aagtgtctgc tgaactacaa gcaggaaggt 900 gataagtgtg tggaaaaccc caatcctact tgtaacgaga acaatggtgg atgtgatgcc 960 gatgccaagt gtaccgagga ggattcaggg agcaacggga agaagatcac ctgtgagtgt 1020 accaagcctg attcttatcc actgttcgat ggtatcttct gtagt 1065 2 1088 DNA preferably, a bacterium, virus, or parasite 2 gcagtaactc cttccgtaat tgataacata ctttctaaaa ttgaaaatga atatgaggtt 60 ttatatttaa aacctttagc aggtgtttat agaagtttaa aaaaacaatt agaaaataac 120 gttatgacat ttaatgttaa tgttaaggat attttaaatt cacgatttaa taaacgtgaa 180 aatttcaaaa atgttttaga atcagattta attccatata aagatttaac atcaagtaat 240 tatgttgtca aagatccata taaatttctt aataaagaaa aaagagataa attcttaagc 300 agttataatt atattaagga ttcaatagat acggatataa attttgcaaa tgatgttctt 360 ggatattata aaatattatc cgaaaaatat aaatcagatt tagattcaat taaaaaatat 420 atcaacgaca aacaaggtga aaatgagaaa taccttccct ttttaaacaa tattgagacc 480 ttatataaaa cagttaatga taaaattgat ttatttgtaa ttcatttaga agcaaaagtt 540 ctaaattata catatgagaa atcaaacgta gaagttaaaa taaaagaact taattactta 600 aaaacaattc aagacaaatt ggcagatttt aaaaaaaata acaatttcgt tggaattgct 660 gatttatcaa cagattataa ccataataac ttattgacaa agttccttag tacaggtatg 720 gtttttgaaa atcttgctaa aaccgtttta tctaatttac ttgatggaaa cttgcaaggt 780 atgttaaaca tttcacaaca ccaatgcgta aaaaaacaat gtccacaaaa ttctggatgt 840 ttcagacatt tagatgaaag agaagaatgt aaatgtttat taaattacaa acaagaaggt 900 gataaatgtg ttgaaaatcc aaatcctact tgtaacgaaa ataatggtgg atgtgatgca 960 gatgccaaat gtaccgaaga agattcaggt agcaacggaa agaaaatcac atgtgaatgt 1020 actaaacctg attcttatcc acttttcgat ggtattttct gcagtcacca ccaccaccac 1080 cactaact 1088 3 88 DNA preferably, a bacterium, virus, or parasite 3 tcgacgagag ccatgaaggt cctcatcctt gcctgtctgg tggctctggc cattgcaaga 60 gagcaggaag aactcaatgt agtcggta 88 4 88 DNA preferably, a bacterium, virus, or parasite 4 gatctaccga ctacattgag ttcttcctgc tctcttgcaa tggccagagc caccagacag 60 gcaaggatga ggaccttcat ggctctcg 88 5 60 DNA preferably, a bacterium, virus, or parasite 5 taactcgagc gaaccatgaa ggtcctcatc cttgcctgtc tggtggctct ggccattgca 60 6 48 DNA preferably, a bacterium, virus, or parasite 6 aattctcgag ttagtggtgg tggtggtggt gactgcagaa ataccatc 48 7 31 DNA preferably, a bacterium, virus, or parasite 7 aatagatctg cagtaactcc ttccgtaatt g 31 8 1142 DNA preferably, a bacterium, virus, or parasite 8 atgaaggtcc tcataattgc ctgtctggtg gctctggcca ttgcagccgt cactccctcc 60 gtcatcgata acatcctgtc caagatcgag aacgagtacg aggtgctgta cctgaagccc 120 ctggcaggag tctacaggag cctgaagaag cagctggaga acaacgtgat gaccttcaac 180 gtgaacgtga aggatatcct gaacagcagg ttcaacaaga gggagaactt caagaacgtg 240 ctggagagcg atctgatccc ctacaaggat ctgaccagca gcaactacgt ggtcaaagat 300 ccctacaagt tcctgaacaa ggagaagaga gataagttcc tgagcagtta caattacatc 360 aaggatagca ttgacaccga tatcaacttc gccaacgatg tcctgggata ctacaagatc 420 ctgtccgaga agtacaagag cgatctggat agcatcaaga agtacatcaa cgataagcag 480 ggagagaacg agaagtacct gcccttcctg aacaacatcg agaccctgta caagaccgtc 540 aacgataaga ttgatctgtt cgtgatccac ctggaggcca aggtcctgca gtacacatat 600 gagaagagca acgtggaggt caagatcaag gagctgaatt acctgaagac catccaggat 660 aagctggccg atttcaagaa gaacaacaac ttcgtcggaa tcgccgatct gagcaccgat 720 tacaaccaca acaacctgct gaccaagttc ctgagcaccg gaatggtctt cgaaaacctg 780 gccaagaccg tcctgagcaa cctgctggat ggaaacctgc agggaatgct gcagatcagc 840 cagcaccagt gtgtgaagaa gcagtgtccc cagaacagcg gatgcttcag acacctggat 900 gagagggagg agtgcaagtg cctgctgaac tacaagcagg aaggagataa gtgtgtggaa 960 aaccccaatc ctacttgtaa cgagaacaat ggaggatgcg atgccgatgc caagtgtacc 1020 gaggaggatt caggaagcaa cggaaagaag atcacctgcg agtgtaccaa gcctgattct 1080 tatccactgt tcgatggtat tttctgcagt caccaccacc accaccacta actcgaggat 1140 cc 1142 

What is claimed is:
 1. A modified nucleic acid sequence comprising SEQ ID NO
 1. 2. A nucleic acid sequence comprising a modified nucleic acid sequence encoding a merozoite surface protein 1 (MSP-1) operably linked to a promoter which directs expression in mammary epithelial cells wherein the AT content of the modified nucleic acid sequence has been reduced by replacing codons from a nucleic acid sequence of SEQ ID NO:2 with mammary gland preferred codons encoding the same amino acid as the replaced codon such that the AT content of the modified nucleic acid sequence is 50% or less.
 3. A vector comprising a modified merozoite surface protein 1 (MSP-1) nucleic acid according to clam
 2. 4. The vector of claim 3, wherein the promoter is a beta casein promoter.
 5. A host cell transformed with the vector according to claim
 3. 6. The cell of claim 5, wherein the cell is a mammalian cell.
 7. The cell of claim 6, wherein the cell is a COS cell.
 8. The cell of claim 6, wherein the cell is a mammary gland epithelial cell.
 9. The nucleic acid of claim 2, wherein the modified nucleic acid comprises at least one additional mammary gland preferred codon other than the mammary gland preferred codons which lowers the AT content.
 10. A nucleic acid sequence comprising a modified nucleic acid sequence encoding a merozoite surface protein 1 (MSP-1) operably linked to a promoter which directs expression in mammary epithelial cells wherein the AT content of the modified nucleic acid sequence has been reduced by replacing codons from a nucleic acid sequence of SEQ ID NO:2 with mammary gland preferred codons such that the AT content of the modified nucleic acid sequence is 50% or less and, wherein at least one glycosylation site of SEQ ID NO:10 has been altered such that it is not functional.
 11. The nucleic acid of claim 10, wherein the glycosylation site which is not functional is at position 182 of SEQ ID NO:10.
 12. The nucleic acid of claim 10, wherein the glycosylation site which is not functional is at position 263 of SEQ ID NO:10.
 13. The nucleic acid of claim 9, wherein the promoter is a beta casein promoter.
 14. The nucleic acid of claim 10, wherein the promoter is a beta casein promoter.
 15. A nucleic acid comprising a modified nucleic acid encoding merozoite surface protein 1 (MSP-1) operably linked to a promoter which directs expression in mammary epithelial cells, wherein all mRNA instability motifs present in the wild-type nucleic acid sequence encoding MSP-1 have been eliminated by replacing codons of SEQ ID NO:2 with mammary gland preferred codons encoding the same amino acid as the replaced codon.
 16. The nucleic acid of claim 15, wherein the promoter is a beta casein promoter.
 17. The nucleic acid of claim 15, wherein the modified nucleic acid comprises at least one additional mammary gland preferred codon other than the codon replaced to eliminate an mRNA instability motif.
 18. The nucleic acid of claim 15, wherein all of the codons of the modified nucleic acid are mammary gland preferred codons.
 19. A nucleic acid comprising a modified nucleic acid encoding merozoite surface protein 1 (MSP-1) operably linked to a promoter which directs expression in mammary epithelial cells, wherein all mRNA instability motifs present in the wild-type nucleic acid sequence encoding MSP-1 have been eliminated by replacing codons of SEQ ID NO:2 with mammary gland preferred codons, and wherein at least one glycosylation site of SEQ ID NO:10 has been altered such that it is not functional.
 20. The nucleic acid of claim 19, wherein the glycosylation site which is not functional is at position 182 of SEQ ID NO:10.
 21. The nucleic acid of claim 19, wherein the glucosylation site which is not functional is at position 263 of SEQ ID NO:10.
 22. The nucleic acid of claim 19, wherein the glycosylation sites at positions 182 and 263 of SEQ ID NO:10 are not functional. 